REPA: Applying Pathway Analysis to Genome-wide Transcription Factor Binding Data.

نویسندگان

  • Pranjal Patra
  • Tatsuo Izawa
  • Lourdes Pena Castillo
چکیده

Pathway analysis has been extensively applied to aid in the interpretation of the results of genome-wide transcription profiling studies, and has been shown to successfully find associations between the biological phenomena under study and biological pathways. There are two widely used approaches of pathway analysis: over-representation analysis, and gene set analysis. Recently genome-wide transcription factor binding data has become widely available allowing for the application of pathway analysis to this type of data. In this work, we developed REPA (Regulatory Enrichment Pathway Analysis) to apply gene set analysis to genome-wide transcription factor binding data to infer associations between transcription factors and biological pathways. We used the transcription factor binding data generated by the ENCODE project, and gene sets from the Molecular Signatures and KEGG databases. Our results showed that 54% of the predictions examined have literature support and that REPA's recall is roughly 54%. This level of precision is promising as several of REPA's predictions are expected to be novel and can be used to guide new research avenues. In addition, the results of our case studies showed that REPA enhances the interpretation of genome-wide transcription profiling studies by suggesting putative regulators behind the observed transcriptional responses.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Global Mapping of Transcription Factor Binding Sites by Sequencing Chromatin Surrogates: a Perspective on Experimental Design, Data Analysis, and Open Problems

Mapping genome-wide binding sites of all transcription factors (TFs) in all biological contexts is a critical step toward understanding gene regulation. The state-of-the-art technologies for mapping transcription factor binding sites (TFBSs) couple chromatin immunoprecipitation (ChIP) with high-throughput sequencing (ChIP-seq) or tiling array hybridization (ChIP-chip). These technologies have l...

متن کامل

An integrated workflow for analysis of ChIP-chip data.

Although ChIP-chip is a powerful tool for genome-wide discovery of transcription factor target genes, the steps involving raw data analysis, identification of promoters, and correlation with binding sites are still laborious processes. Therefore, we report an integrated workflow for the analysis of promoter tiling arrays with the Genomatix ChipInspector system. We compare this tool with open-so...

متن کامل

Systems assessment of transcriptional regulation on central carbon metabolism by Cra and CRP.

Two major transcriptional regulators of carbon metabolism in bacteria are Cra and CRP. CRP is considered to be the main mediator of catabolite repression. Unlike for CRP, in vivo DNA binding information of Cra is scarce. Here we generate and integrate ChIP-exo and RNA-seq data to identify 39 binding sites for Cra and 97 regulon genes that are regulated by Cra in Escherichia coli. An integrated ...

متن کامل

Genome-wide Association Study to Identify Genes and Biological Pathways Associated with Type Traits in Cattle using Pathway Analysis

Extended Abstract Introduction and Objective: Type traits describing the skeletal characteristics of an animal are moderately to strongly genetically correlate with other economically important traits in cattle including fertility, longevity and carcass traits. The present study aimed to conduct a genome wide association studies (GWAS) based on gene-set enrichment analysis for identifying the ...

متن کامل

A Biophysical Model for Analysis of Transcription Factor Interaction and Binding Site Arrangement from Genome-Wide Binding Data

BACKGROUND How transcription factors (TFs) interact with cis-regulatory sequences and interact with each other is a fundamental, but not well understood, aspect of gene regulation. METHODOLOGY/PRINCIPAL FINDINGS We present a computational method to address this question, relying on the established biophysical principles. This method, STAP (sequence to affinity prediction), takes into account ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IEEE/ACM transactions on computational biology and bioinformatics

دوره   شماره 

صفحات  -

تاریخ انتشار 2015